Picture for Shujian Huang

Shujian Huang

Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

Add code
Feb 05, 2026
Viaarxiv icon

PEGRL: Improving Machine Translation by Post-Editing Guided Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers

Add code
Jan 29, 2026
Viaarxiv icon

Align to the Pivot: Dual Alignment with Self-Feedback for Multilingual Math Reasoning

Add code
Jan 25, 2026
Viaarxiv icon

Making Mathematical Reasoning Adaptive

Add code
Oct 06, 2025
Figure 1 for Making Mathematical Reasoning Adaptive
Figure 2 for Making Mathematical Reasoning Adaptive
Figure 3 for Making Mathematical Reasoning Adaptive
Figure 4 for Making Mathematical Reasoning Adaptive
Viaarxiv icon

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Add code
Aug 20, 2025
Viaarxiv icon

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Add code
May 27, 2025
Viaarxiv icon

PATS: Process-Level Adaptive Thinking Mode Switching

Add code
May 25, 2025
Viaarxiv icon

Internal Bias in Reasoning Models leads to Overthinking

Add code
May 22, 2025
Viaarxiv icon

Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement

Add code
May 17, 2025
Viaarxiv icon